Assessing the predictive performance of machine learners in software defect prediction

نویسندگان

  • Martin Shepperd
  • D. Bowes
  • D. Gray
چکیده

We present a methodology for Bayesian analysis of software quality. We cast our research in the broader context ofconstructing a causal framework that can include process, product, and other diverse sources of information regarding fault introductionduring the software development process. In this paper, we discuss the aspect of relating internal product metrics to external qualitymetrics. Specifically, we build a Bayesian network (BN) model to relate object-oriented software metrics to software fault content and faultproneness. Assuming that the relationship can be described as a generalized linear model, we derive parametric functional forms for thetarget node conditional distributions in the BN. These functional forms are shown to be able to represent linear, Poisson, and binomiallogistic regression. The models are empirically evaluated using a public domain data set from a software subsystem. The results showthat our approach produces statistically significant estimations and that our overall modeling method performs no worse than existingtechniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Benchmarking Machine Learning Technologies for Software Defect Detection

Machine Learning approaches are good in solving problems that have less information. In most cases, the software domain problems characterize as a process of learning that depend on the various circumstances and changes accordingly. A predictive model is constructed by using machine learning approaches and classified them into defective and non-defective modules. Machine learning techniques hel...

متن کامل

Application of genetic algorithm (GA) to select input variables in support vector machine (SVM) for analyzing the occurrence of roach, Rutilus rutilus, in streams

Support vector machine (SVM) was used to analyze the occurrence of roach in Flemish stream basins (Belgium). Several habitat and physico?chemical variables were used as inputs for the model development. The biotic variable merely consisted of abundance data which was used for predicting presence/absence of roach. Genetic algorithm (GA) was combined with SVM in order to select the most important...

متن کامل

A Feature Selection Based Model for Software Defect Prediction

Software is a complex entity composed in various modules with varied range of defect occurrence possibility. Efficient and timely prediction of defect occurrence in software allows software project managers to effectively utilize people, cost, time for better quality assurance. The presence of defects in a software leads to a poor quality software and also responsible for the failure of a softw...

متن کامل

TUNNEL BORING MACHINE PENETRATION RATE PREDICTION BASED ON RELEVANCE VECTOR REGRESSION

key factor in the successful application of a tunnel boring machine (TBM) in tunneling is the ability to develop accurate penetration rate estimates for determining project schedule and costs. Thus establishing a relationship between rock properties and TBM penetration rate can be very helpful in estimation of this vital parameter. However, this parameter cannot be simply predicted since there ...

متن کامل

An Efficient Predictive Model for Probability of Genetic Diseases Transmission Using a Combined Model

In this article, a new combined approach of a decision tree and clustering is presented to predict the transmission of genetic diseases. In this article, the performance of these algorithms is compared for more accurate prediction of disease transmission under the same condition and based on a series of measures like the positive predictive value, negative predictive value, accuracy, sensitivit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013